Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 5806 |
| Missing cells | 7876 |
| Missing cells (%) | 10.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 589.8 KiB |
| Average record size in memory | 104.0 B |
Variable types
| Numeric | 6 |
|---|---|
| Text | 5 |
| Categorical | 2 |
age_certification is highly overall correlated with runtime and 1 other fields | High correlation |
df_index is highly overall correlated with release_year and 1 other fields | High correlation |
release_year is highly overall correlated with df_index and 1 other fields | High correlation |
runtime is highly overall correlated with age_certification and 1 other fields | High correlation |
seasons is highly overall correlated with df_index and 2 other fields | High correlation |
type is highly overall correlated with age_certification and 2 other fields | High correlation |
age_certification has 2610 (45.0%) missing values | Missing |
seasons has 3759 (64.7%) missing values | Missing |
imdb_id has 444 (7.6%) missing values | Missing |
imdb_score has 523 (9.0%) missing values | Missing |
imdb_votes has 539 (9.3%) missing values | Missing |
df_index is uniformly distributed | Uniform |
df_index has unique values | Unique |
id has unique values | Unique |
Reproduction
| Analysis started | 2023-11-27 20:41:58.878777 |
|---|---|
| Analysis finished | 2023-11-27 20:42:06.659915 |
| Duration | 7.78 seconds |
| Software version | ydata-profiling vv4.6.2 |
| Download configuration | config.json |
df_index
Real number (ℝ)
HIGH CORRELATION  UNIFORM  UNIQUE 
| Distinct | 5806 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2902.5 |
| Minimum | 0 |
|---|---|
| Maximum | 5805 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 45.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 290.25 |
| Q1 | 1451.25 |
| median | 2902.5 |
| Q3 | 4353.75 |
| 95-th percentile | 5514.75 |
| Maximum | 5805 |
| Range | 5805 |
| Interquartile range (IQR) | 2902.5 |
Descriptive statistics
| Standard deviation | 1676.1922 |
|---|---|
| Coefficient of variation (CV) | 0.57749945 |
| Kurtosis | -1.2 |
| Mean | 2902.5 |
| Median Absolute Deviation (MAD) | 1451.5 |
| Skewness | 0 |
| Sum | 16851915 |
| Variance | 2809620.2 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 3877 | 1 | < 0.1% |
| 3875 | 1 | < 0.1% |
| 3874 | 1 | < 0.1% |
| 3873 | 1 | < 0.1% |
| 3872 | 1 | < 0.1% |
| 3871 | 1 | < 0.1% |
| 3870 | 1 | < 0.1% |
| 3869 | 1 | < 0.1% |
| 3868 | 1 | < 0.1% |
| Other values (5796) | 5796 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 5805 | 1 | |
| 5804 | 1 | |
| 5803 | 1 | |
| 5802 | 1 | |
| 5801 | 1 | |
| 5800 | 1 | |
| 5799 | 1 | |
| 5798 | 1 | |
| 5797 | 1 | |
| 5796 | 1 |
id
Text
UNIQUE 
| Distinct | 5806 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 45.5 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 7.8024457 |
| Min length | 3 |
Characters and Unicode
| Total characters | 45301 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5806 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | ts300399 |
|---|---|
| 2nd row | tm84618 |
| 3rd row | tm127384 |
| 4th row | tm70993 |
| 5th row | tm190788 |
| Value | Count | Frequency (%) |
| ts300399 | 1 | < 0.1% |
| tm70993 | 1 | < 0.1% |
| ts22164 | 1 | < 0.1% |
| tm14873 | 1 | < 0.1% |
| tm185072 | 1 | < 0.1% |
| tm98978 | 1 | < 0.1% |
| tm119281 | 1 | < 0.1% |
| tm67378 | 1 | < 0.1% |
| tm44204 | 1 | < 0.1% |
| tm69778 | 1 | < 0.1% |
| Other values (5796) | 5796 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 5806 | |
| 1 | 4122 | |
| 2 | 4098 | |
| m | 3759 | |
| 4 | 3643 | |
| 3 | 3641 | |
| 8 | 3641 | |
| 5 | 3079 | 6.8% |
| 0 | 2908 | 6.4% |
| 9 | 2900 | 6.4% |
| Other values (3) | 7704 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 33689 | |
| Lowercase Letter | 11612 | 25.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4122 | |
| 2 | 4098 | |
| 4 | 3643 | |
| 3 | 3641 | |
| 8 | 3641 | |
| 5 | 3079 | |
| 0 | 2908 | |
| 9 | 2900 | |
| 7 | 2889 | |
| 6 | 2768 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 5806 | |
| m | 3759 | |
| s | 2047 | 17.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 33689 | |
| Latin | 11612 | 25.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 4122 | |
| 2 | 4098 | |
| 4 | 3643 | |
| 3 | 3641 | |
| 8 | 3641 | |
| 5 | 3079 | |
| 0 | 2908 | |
| 9 | 2900 | |
| 7 | 2889 | |
| 6 | 2768 |
Latin
| Value | Count | Frequency (%) |
| t | 5806 | |
| m | 3759 | |
| s | 2047 | 17.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 45301 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 5806 | |
| 1 | 4122 | |
| 2 | 4098 | |
| m | 3759 | |
| 4 | 3643 | |
| 3 | 3641 | |
| 8 | 3641 | |
| 5 | 3079 | 6.8% |
| 0 | 2908 | 6.4% |
| 9 | 2900 | 6.4% |
| Other values (3) | 7704 |
title
Text
| Distinct | 5751 |
|---|---|
| Distinct (%) | 99.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 45.5 KiB |
Length
| Max length | 104 |
|---|---|
| Median length | 61 |
| Mean length | 17.854608 |
| Min length | 1 |
Characters and Unicode
| Total characters | 103646 |
|---|---|
| Distinct characters | 154 |
| Distinct categories | 16 ? |
| Distinct scripts | 6 ? |
| Distinct blocks | 7 ? |
Unique
| Unique | 5699 ? |
|---|---|
| Unique (%) | 98.2% |
Sample
| 1st row | Five Came Back: The Reference Films |
|---|---|
| 2nd row | Taxi Driver |
| 3rd row | Monty Python and the Holy Grail |
| 4th row | Life of Brian |
| 5th row | The Exorcist |
| Value | Count | Frequency (%) |
| the | 1511 | 8.3% |
| of | 445 | 2.5% |
| a | 265 | 1.5% |
| in | 201 | 1.1% |
| 179 | 1.0% | |
| and | 140 | 0.8% |
| to | 138 | 0.8% |
| love | 125 | 0.7% |
| my | 102 | 0.6% |
| 2 | 71 | 0.4% |
| Other values (6735) | 14940 |
Most occurring characters
| Value | Count | Frequency (%) |
| 12314 | 11.9% | |
| e | 9696 | 9.4% |
| a | 7391 | 7.1% |
| o | 5895 | 5.7% |
| i | 5735 | 5.5% |
| n | 5413 | 5.2% |
| r | 5412 | 5.2% |
| t | 4753 | 4.6% |
| s | 4091 | 3.9% |
| h | 3680 | 3.6% |
| Other values (144) | 39266 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 72233 | |
| Uppercase Letter | 16364 | 15.8% |
| Space Separator | 12314 | 11.9% |
| Other Punctuation | 1989 | 1.9% |
| Decimal Number | 504 | 0.5% |
| Dash Punctuation | 141 | 0.1% |
| Other Letter | 42 | < 0.1% |
| Open Punctuation | 15 | < 0.1% |
| Close Punctuation | 15 | < 0.1% |
| Math Symbol | 10 | < 0.1% |
| Other values (6) | 19 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 9696 | |
| a | 7391 | |
| o | 5895 | 8.2% |
| i | 5735 | 7.9% |
| n | 5413 | 7.5% |
| r | 5412 | 7.5% |
| t | 4753 | 6.6% |
| s | 4091 | 5.7% |
| h | 3680 | 5.1% |
| l | 3511 | 4.9% |
| Other values (40) | 16656 |
Other Letter
| Value | Count | Frequency (%) |
| า | 6 | 14.3% |
| ว | 3 | 7.1% |
| น | 3 | 7.1% |
| ร | 2 | 4.8% |
| ล | 2 | 4.8% |
| 糖 | 2 | 4.8% |
| 요 | 1 | 2.4% |
| 타 | 1 | 2.4% |
| 스 | 1 | 2.4% |
| 버 | 1 | 2.4% |
| Other values (20) | 20 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1864 | 11.4% |
| S | 1505 | 9.2% |
| M | 1216 | 7.4% |
| B | 1105 | 6.8% |
| C | 1040 | 6.4% |
| A | 995 | 6.1% |
| L | 814 | 5.0% |
| D | 809 | 4.9% |
| H | 720 | 4.4% |
| P | 646 | 3.9% |
| Other values (19) | 5650 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1108 | |
| ' | 272 | 13.7% |
| . | 150 | 7.5% |
| & | 128 | 6.4% |
| , | 115 | 5.8% |
| ! | 108 | 5.4% |
| ? | 49 | 2.5% |
| * | 25 | 1.3% |
| / | 13 | 0.7% |
| # | 9 | 0.5% |
| Other values (7) | 12 | 0.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 130 | |
| 0 | 81 | |
| 1 | 76 | |
| 3 | 49 | 9.7% |
| 4 | 35 | 6.9% |
| 9 | 35 | 6.9% |
| 8 | 26 | 5.2% |
| 6 | 25 | 5.0% |
| 5 | 24 | 4.8% |
| 7 | 23 | 4.6% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ้ | 2 | |
| ั | 1 | |
| ิ | 1 | |
| ่ | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 135 | |
| – | 6 | 4.3% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 8 | |
| ~ | 2 | 20.0% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 6 | |
| ” | 1 | 14.3% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 1 | |
| ² | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 12314 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 15 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 15 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 1 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 88590 | |
| Common | 15002 | 14.5% |
| Thai | 30 | < 0.1% |
| Hangul | 11 | < 0.1% |
| Cyrillic | 7 | < 0.1% |
| Han | 6 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 9696 | 10.9% |
| a | 7391 | 8.3% |
| o | 5895 | 6.7% |
| i | 5735 | 6.5% |
| n | 5413 | 6.1% |
| r | 5412 | 6.1% |
| t | 4753 | 5.4% |
| s | 4091 | 4.6% |
| h | 3680 | 4.2% |
| l | 3511 | 4.0% |
| Other values (62) | 33013 |
Common
| Value | Count | Frequency (%) |
| 12314 | ||
| : | 1108 | 7.4% |
| ' | 272 | 1.8% |
| . | 150 | 1.0% |
| - | 135 | 0.9% |
| 2 | 130 | 0.9% |
| & | 128 | 0.9% |
| , | 115 | 0.8% |
| ! | 108 | 0.7% |
| 0 | 81 | 0.5% |
| Other values (31) | 461 | 3.1% |
Thai
| Value | Count | Frequency (%) |
| า | 6 | |
| ว | 3 | 10.0% |
| น | 3 | 10.0% |
| ร | 2 | 6.7% |
| ล | 2 | 6.7% |
| ้ | 2 | 6.7% |
| ส | 1 | 3.3% |
| เ | 1 | 3.3% |
| ม | 1 | 3.3% |
| ง | 1 | 3.3% |
| Other values (8) | 8 |
Hangul
| Value | Count | Frequency (%) |
| 요 | 1 | |
| 타 | 1 | |
| 스 | 1 | |
| 버 | 1 | |
| 법 | 1 | |
| 마 | 1 | |
| 캐 | 1 | |
| 치 | 1 | |
| 티 | 1 | |
| 니 | 1 |
Cyrillic
| Value | Count | Frequency (%) |
| о | 1 | |
| т | 1 | |
| Т | 1 | |
| р | 1 | |
| и | 1 | |
| к | 1 | |
| а | 1 |
Han
| Value | Count | Frequency (%) |
| 糖 | 2 | |
| 料 | 1 | |
| 幸 | 1 | |
| 福 | 1 | |
| 理 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 103465 | |
| None | 111 | 0.1% |
| Thai | 30 | < 0.1% |
| Punctuation | 16 | < 0.1% |
| Hangul | 11 | < 0.1% |
| Cyrillic | 7 | < 0.1% |
| CJK | 6 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 12314 | 11.9% | |
| e | 9696 | 9.4% |
| a | 7391 | 7.1% |
| o | 5895 | 5.7% |
| i | 5735 | 5.5% |
| n | 5413 | 5.2% |
| r | 5412 | 5.2% |
| t | 4753 | 4.6% |
| s | 4091 | 4.0% |
| h | 3680 | 3.6% |
| Other values (73) | 39085 |
None
| Value | Count | Frequency (%) |
| é | 24 | |
| í | 20 | |
| á | 15 | |
| ñ | 11 | |
| ó | 10 | |
| ü | 4 | 3.6% |
| ú | 3 | 2.7% |
| ı | 3 | 2.7% |
| ô | 3 | 2.7% |
| · | 2 | 1.8% |
| Other values (15) | 16 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 6 | |
| – | 6 | |
| … | 2 | 12.5% |
| “ | 1 | 6.2% |
| ” | 1 | 6.2% |
Thai
| Value | Count | Frequency (%) |
| า | 6 | |
| ว | 3 | 10.0% |
| น | 3 | 10.0% |
| ร | 2 | 6.7% |
| ล | 2 | 6.7% |
| ้ | 2 | 6.7% |
| ส | 1 | 3.3% |
| เ | 1 | 3.3% |
| ม | 1 | 3.3% |
| ง | 1 | 3.3% |
| Other values (8) | 8 |
CJK
| Value | Count | Frequency (%) |
| 糖 | 2 | |
| 料 | 1 | |
| 幸 | 1 | |
| 福 | 1 | |
| 理 | 1 |
Cyrillic
| Value | Count | Frequency (%) |
| о | 1 | |
| т | 1 | |
| Т | 1 | |
| р | 1 | |
| и | 1 | |
| к | 1 | |
| а | 1 |
Hangul
| Value | Count | Frequency (%) |
| 요 | 1 | |
| 타 | 1 | |
| 스 | 1 | |
| 버 | 1 | |
| 법 | 1 | |
| 마 | 1 | |
| 캐 | 1 | |
| 치 | 1 | |
| 티 | 1 | |
| 니 | 1 |
type
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 45.5 KiB |
| MOVIE | |
|---|---|
| SHOW |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.6474337 |
| Min length | 4 |
Characters and Unicode
| Total characters | 26983 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SHOW |
|---|---|
| 2nd row | MOVIE |
| 3rd row | MOVIE |
| 4th row | MOVIE |
| 5th row | MOVIE |
Common Values
| Value | Count | Frequency (%) |
| MOVIE | 3759 | |
| SHOW | 2047 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| movie | 3759 | |
| show | 2047 |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 5806 | |
| M | 3759 | |
| V | 3759 | |
| I | 3759 | |
| E | 3759 | |
| S | 2047 | 7.6% |
| H | 2047 | 7.6% |
| W | 2047 | 7.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 26983 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 5806 | |
| M | 3759 | |
| V | 3759 | |
| I | 3759 | |
| E | 3759 | |
| S | 2047 | 7.6% |
| H | 2047 | 7.6% |
| W | 2047 | 7.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 26983 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| O | 5806 | |
| M | 3759 | |
| V | 3759 | |
| I | 3759 | |
| E | 3759 | |
| S | 2047 | 7.6% |
| H | 2047 | 7.6% |
| W | 2047 | 7.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26983 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| O | 5806 | |
| M | 3759 | |
| V | 3759 | |
| I | 3759 | |
| E | 3759 | |
| S | 2047 | 7.6% |
| H | 2047 | 7.6% |
| W | 2047 | 7.6% |
release_year
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 67 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2016.0134 |
| Minimum | 1945 |
|---|---|
| Maximum | 2022 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 45.5 KiB |
Quantile statistics
| Minimum | 1945 |
|---|---|
| 5-th percentile | 2003 |
| Q1 | 2015 |
| median | 2018 |
| Q3 | 2020 |
| 95-th percentile | 2021 |
| Maximum | 2022 |
| Range | 77 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 7.324883 |
|---|---|
| Coefficient of variation (CV) | 0.0036333503 |
| Kurtosis | 17.057172 |
| Mean | 2016.0134 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -3.5203185 |
| Sum | 11704974 |
| Variance | 53.653911 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2019 | 848 | |
| 2020 | 805 | |
| 2018 | 774 | |
| 2021 | 758 | |
| 2017 | 580 | |
| 2016 | 371 | |
| 2015 | 236 | 4.1% |
| 2022 | 217 | 3.7% |
| 2014 | 160 | 2.8% |
| 2013 | 142 | 2.4% |
| Other values (57) | 915 |
| Value | Count | Frequency (%) |
| 1945 | 1 | |
| 1953 | 1 | |
| 1954 | 2 | |
| 1956 | 1 | |
| 1958 | 1 | |
| 1959 | 1 | |
| 1960 | 1 | |
| 1961 | 1 | |
| 1962 | 1 | |
| 1963 | 1 |
| Value | Count | Frequency (%) |
| 2022 | 217 | 3.7% |
| 2021 | 758 | |
| 2020 | 805 | |
| 2019 | 848 | |
| 2018 | 774 | |
| 2017 | 580 | |
| 2016 | 371 | |
| 2015 | 236 | 4.1% |
| 2014 | 160 | 2.8% |
| 2013 | 142 | 2.4% |
age_certification
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 2610 |
| Missing (%) | 45.0% |
| Memory size | 45.5 KiB |
| TV-MA | |
|---|---|
| R | |
| TV-14 | |
| PG-13 | |
| PG | |
| Other values (6) |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 3.8288486 |
| Min length | 1 |
Characters and Unicode
| Total characters | 12237 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | TV-MA |
|---|---|
| 2nd row | R |
| 3rd row | PG |
| 4th row | R |
| 5th row | R |
Common Values
| Value | Count | Frequency (%) |
| TV-MA | 841 | 14.5% |
| R | 575 | 9.9% |
| TV-14 | 470 | 8.1% |
| PG-13 | 440 | 7.6% |
| PG | 246 | 4.2% |
| TV-PG | 186 | 3.2% |
| G | 131 | 2.3% |
| TV-Y7 | 112 | 1.9% |
| TV-Y | 105 | 1.8% |
| TV-G | 76 | 1.3% |
| (Missing) | 2610 |
Length
| Value | Count | Frequency (%) |
| tv-ma | 841 | |
| r | 575 | |
| tv-14 | 470 | |
| pg-13 | 440 | |
| pg | 246 | 7.7% |
| tv-pg | 186 | 5.8% |
| g | 131 | 4.1% |
| tv-y7 | 112 | 3.5% |
| tv-y | 105 | 3.3% |
| tv-g | 76 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 2244 | |
| T | 1790 | |
| V | 1790 | |
| G | 1079 | |
| 1 | 924 | |
| P | 872 | 7.1% |
| M | 841 | 6.9% |
| A | 841 | 6.9% |
| R | 575 | 4.7% |
| 4 | 470 | 3.8% |
| Other values (5) | 811 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 8033 | |
| Dash Punctuation | 2244 | 18.3% |
| Decimal Number | 1960 | 16.0% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1790 | |
| V | 1790 | |
| G | 1079 | |
| P | 872 | |
| M | 841 | |
| A | 841 | |
| R | 575 | 7.2% |
| Y | 217 | 2.7% |
| N | 14 | 0.2% |
| C | 14 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 924 | |
| 4 | 470 | |
| 3 | 440 | |
| 7 | 126 | 6.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2244 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8033 | |
| Common | 4204 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 1790 | |
| V | 1790 | |
| G | 1079 | |
| P | 872 | |
| M | 841 | |
| A | 841 | |
| R | 575 | 7.2% |
| Y | 217 | 2.7% |
| N | 14 | 0.2% |
| C | 14 | 0.2% |
Common
| Value | Count | Frequency (%) |
| - | 2244 | |
| 1 | 924 | |
| 4 | 470 | 11.2% |
| 3 | 440 | 10.5% |
| 7 | 126 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12237 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 2244 | |
| T | 1790 | |
| V | 1790 | |
| G | 1079 | |
| 1 | 924 | |
| P | 872 | 7.1% |
| M | 841 | 6.9% |
| A | 841 | 6.9% |
| R | 575 | 4.7% |
| 4 | 470 | 3.8% |
| Other values (5) | 811 | 6.6% |
runtime
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 205 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 77.643989 |
| Minimum | 0 |
|---|---|
| Maximum | 251 |
| Zeros | 24 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 45.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 21 |
| Q1 | 44 |
| median | 84 |
| Q3 | 105 |
| 95-th percentile | 140.75 |
| Maximum | 251 |
| Range | 251 |
| Interquartile range (IQR) | 61 |
Descriptive statistics
| Standard deviation | 39.47416 |
|---|---|
| Coefficient of variation (CV) | 0.50839944 |
| Kurtosis | -0.41016214 |
| Mean | 77.643989 |
| Median Absolute Deviation (MAD) | 31 |
| Skewness | 0.22024045 |
| Sum | 450801 |
| Variance | 1558.2093 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24 | 129 | 2.2% |
| 90 | 122 | 2.1% |
| 45 | 108 | 1.9% |
| 95 | 105 | 1.8% |
| 100 | 104 | 1.8% |
| 44 | 102 | 1.8% |
| 23 | 92 | 1.6% |
| 25 | 85 | 1.5% |
| 94 | 84 | 1.4% |
| 105 | 83 | 1.4% |
| Other values (195) | 4792 |
| Value | Count | Frequency (%) |
| 0 | 24 | |
| 2 | 4 | 0.1% |
| 3 | 8 | 0.1% |
| 4 | 5 | 0.1% |
| 5 | 7 | 0.1% |
| 6 | 11 | |
| 7 | 5 | 0.1% |
| 8 | 8 | 0.1% |
| 9 | 9 | 0.2% |
| 10 | 12 |
| Value | Count | Frequency (%) |
| 251 | 1 | |
| 240 | 1 | |
| 235 | 1 | |
| 230 | 1 | |
| 229 | 1 | |
| 225 | 2 | |
| 224 | 1 | |
| 217 | 1 | |
| 213 | 1 | |
| 210 | 1 |
genres
Text
| Distinct | 1626 |
|---|---|
| Distinct (%) | 28.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 45.5 KiB |
Length
| Max length | 96 |
|---|---|
| Median length | 83 |
| Mean length | 26.561488 |
| Min length | 2 |
Characters and Unicode
| Total characters | 154216 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1214 ? |
|---|---|
| Unique (%) | 20.9% |
Sample
| 1st row | ['documentation'] |
|---|---|
| 2nd row | ['crime', 'drama'] |
| 3rd row | ['comedy', 'fantasy'] |
| 4th row | ['comedy'] |
| 5th row | ['horror'] |
| Value | Count | Frequency (%) |
| drama | 2901 | |
| comedy | 2269 | |
| thriller | 1178 | |
| action | 1053 | 7.2% |
| romance | 958 | 6.5% |
| documentation | 910 | 6.2% |
| crime | 891 | 6.1% |
| animation | 665 | 4.5% |
| fantasy | 631 | 4.3% |
| family | 622 | 4.3% |
| Other values (10) | 2548 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 29116 | |
| a | 12769 | 8.3% |
| r | 9521 | 6.2% |
| m | 9454 | 6.1% |
| 8820 | 5.7% | |
| , | 8820 | 5.7% |
| o | 8384 | 5.4% |
| i | 7852 | 5.1% |
| e | 7437 | 4.8% |
| c | 6906 | 4.5% |
| Other values (13) | 45137 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 95848 | |
| Other Punctuation | 37936 | 24.6% |
| Space Separator | 8820 | 5.7% |
| Open Punctuation | 5806 | 3.8% |
| Close Punctuation | 5806 | 3.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 12769 | |
| r | 9521 | |
| m | 9454 | |
| o | 8384 | |
| i | 7852 | |
| e | 7437 | |
| c | 6906 | |
| n | 6296 | |
| d | 6080 | |
| t | 6013 | 6.3% |
| Other values (8) | 15136 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 29116 | |
| , | 8820 | 23.2% |
Space Separator
| Value | Count | Frequency (%) |
| 8820 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 5806 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 5806 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 95848 | |
| Common | 58368 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 12769 | |
| r | 9521 | |
| m | 9454 | |
| o | 8384 | |
| i | 7852 | |
| e | 7437 | |
| c | 6906 | |
| n | 6296 | |
| d | 6080 | |
| t | 6013 | 6.3% |
| Other values (8) | 15136 |
Common
| Value | Count | Frequency (%) |
| ' | 29116 | |
| 8820 | 15.1% | |
| , | 8820 | 15.1% |
| [ | 5806 | 9.9% |
| ] | 5806 | 9.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 154216 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| ' | 29116 | |
| a | 12769 | 8.3% |
| r | 9521 | 6.2% |
| m | 9454 | 6.1% |
| 8820 | 5.7% | |
| , | 8820 | 5.7% |
| o | 8384 | 5.4% |
| i | 7852 | 5.1% |
| e | 7437 | 4.8% |
| c | 6906 | 4.5% |
| Other values (13) | 45137 |
| Distinct | 449 |
|---|---|
| Distinct (%) | 7.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 45.5 KiB |
Length
| Max length | 42 |
|---|---|
| Median length | 6 |
| Mean length | 6.7917671 |
| Min length | 2 |
Characters and Unicode
| Total characters | 39433 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 312 ? |
|---|---|
| Unique (%) | 5.4% |
Sample
| 1st row | ['US'] |
|---|---|
| 2nd row | ['US'] |
| 3rd row | ['GB'] |
| 4th row | ['GB'] |
| 5th row | ['US'] |
| Value | Count | Frequency (%) |
| us | 2327 | |
| in | 629 | 9.4% |
| gb | 406 | 6.0% |
| jp | 291 | 4.3% |
| fr | 248 | 3.7% |
| 232 | 3.4% | |
| kr | 216 | 3.2% |
| ca | 216 | 3.2% |
| es | 212 | 3.2% |
| de | 139 | 2.1% |
| Other values (98) | 1810 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 12988 | |
| [ | 5806 | |
| ] | 5806 | |
| S | 2655 | 6.7% |
| U | 2454 | 6.2% |
| , | 920 | 2.3% |
| 920 | 2.3% | |
| N | 888 | 2.3% |
| I | 827 | 2.1% |
| R | 744 | 1.9% |
| Other values (26) | 5425 |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Punctuation | 13908 | |
| Uppercase Letter | 12987 | |
| Open Punctuation | 5806 | |
| Close Punctuation | 5806 | |
| Space Separator | 920 | 2.3% |
| Lowercase Letter | 6 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2655 | |
| U | 2454 | |
| N | 888 | 6.8% |
| I | 827 | 6.4% |
| R | 744 | 5.7% |
| B | 603 | 4.6% |
| G | 578 | 4.5% |
| E | 535 | 4.1% |
| A | 488 | 3.8% |
| P | 463 | 3.6% |
| Other values (16) | 2752 |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 2 | |
| e | 1 | |
| b | 1 | |
| a | 1 | |
| o | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 12988 | |
| , | 920 | 6.6% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 5806 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 5806 |
Space Separator
| Value | Count | Frequency (%) |
| 920 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 26440 | |
| Latin | 12993 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 2655 | |
| U | 2454 | |
| N | 888 | 6.8% |
| I | 827 | 6.4% |
| R | 744 | 5.7% |
| B | 603 | 4.6% |
| G | 578 | 4.4% |
| E | 535 | 4.1% |
| A | 488 | 3.8% |
| P | 463 | 3.6% |
| Other values (21) | 2758 |
Common
| Value | Count | Frequency (%) |
| ' | 12988 | |
| [ | 5806 | |
| ] | 5806 | |
| , | 920 | 3.5% |
| 920 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39433 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| ' | 12988 | |
| [ | 5806 | |
| ] | 5806 | |
| S | 2655 | 6.7% |
| U | 2454 | 6.2% |
| , | 920 | 2.3% |
| 920 | 2.3% | |
| N | 888 | 2.3% |
| I | 827 | 2.1% |
| R | 744 | 1.9% |
| Other values (26) | 5425 |
seasons
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 23 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 3759 |
| Missing (%) | 64.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.1656082 |
| Minimum | 1 |
|---|---|
| Maximum | 42 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 45.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 6 |
| Maximum | 42 |
| Range | 41 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 2.6362073 |
|---|---|
| Coefficient of variation (CV) | 1.2173057 |
| Kurtosis | 74.502716 |
| Mean | 2.1656082 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.8673539 |
| Sum | 4433 |
| Variance | 6.9495889 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1187 | 20.4% |
| 2 | 374 | 6.4% |
| 3 | 181 | 3.1% |
| 4 | 116 | 2.0% |
| 5 | 76 | 1.3% |
| 6 | 40 | 0.7% |
| 7 | 16 | 0.3% |
| 8 | 14 | 0.2% |
| 9 | 9 | 0.2% |
| 11 | 7 | 0.1% |
| Other values (13) | 27 | 0.5% |
| (Missing) | 3759 |
| Value | Count | Frequency (%) |
| 1 | 1187 | |
| 2 | 374 | 6.4% |
| 3 | 181 | 3.1% |
| 4 | 116 | 2.0% |
| 5 | 76 | 1.3% |
| 6 | 40 | 0.7% |
| 7 | 16 | 0.3% |
| 8 | 14 | 0.2% |
| 9 | 9 | 0.2% |
| 10 | 5 | 0.1% |
| Value | Count | Frequency (%) |
| 42 | 1 | < 0.1% |
| 39 | 1 | < 0.1% |
| 37 | 1 | < 0.1% |
| 29 | 1 | < 0.1% |
| 24 | 3 | |
| 21 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 18 | 1 | < 0.1% |
| 15 | 4 | |
| 14 | 2 |
imdb_id
Text
MISSING 
| Distinct | 5362 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 444 |
| Missing (%) | 7.6% |
| Memory size | 45.5 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 9.3060425 |
| Min length | 9 |
Characters and Unicode
| Total characters | 49899 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5362 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | tt0075314 |
|---|---|
| 2nd row | tt0071853 |
| 3rd row | tt0079470 |
| 4th row | tt0070047 |
| 5th row | tt0063929 |
| Value | Count | Frequency (%) |
| tt9266592 | 1 | < 0.1% |
| tt0068562 | 1 | < 0.1% |
| tt0079470 | 1 | < 0.1% |
| tt0070047 | 1 | < 0.1% |
| tt0063929 | 1 | < 0.1% |
| tt0066999 | 1 | < 0.1% |
| tt0058385 | 1 | < 0.1% |
| tt0080453 | 1 | < 0.1% |
| tt0061418 | 1 | < 0.1% |
| tt0060862 | 1 | < 0.1% |
| Other values (5352) | 5352 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 10724 | |
| 1 | 5107 | |
| 0 | 4557 | |
| 8 | 4181 | 8.4% |
| 4 | 4118 | 8.3% |
| 6 | 4117 | 8.3% |
| 2 | 4099 | 8.2% |
| 3 | 3304 | 6.6% |
| 7 | 3298 | 6.6% |
| 5 | 3240 | 6.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 39175 | |
| Lowercase Letter | 10724 | 21.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5107 | |
| 0 | 4557 | |
| 8 | 4181 | |
| 4 | 4118 | |
| 6 | 4117 | |
| 2 | 4099 | |
| 3 | 3304 | |
| 7 | 3298 | |
| 5 | 3240 | |
| 9 | 3154 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 10724 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 39175 | |
| Latin | 10724 | 21.5% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 5107 | |
| 0 | 4557 | |
| 8 | 4181 | |
| 4 | 4118 | |
| 6 | 4117 | |
| 2 | 4099 | |
| 3 | 3304 | |
| 7 | 3298 | |
| 5 | 3240 | |
| 9 | 3154 |
Latin
| Value | Count | Frequency (%) |
| t | 10724 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49899 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 10724 | |
| 1 | 5107 | |
| 0 | 4557 | |
| 8 | 4181 | 8.4% |
| 4 | 4118 | 8.3% |
| 6 | 4117 | 8.3% |
| 2 | 4099 | 8.2% |
| 3 | 3304 | 6.6% |
| 7 | 3298 | 6.6% |
| 5 | 3240 | 6.5% |
imdb_score
Real number (ℝ)
MISSING 
| Distinct | 81 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 523 |
| Missing (%) | 9.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.5334469 |
| Minimum | 1.5 |
|---|---|
| Maximum | 9.6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 45.5 KiB |
Quantile statistics
| Minimum | 1.5 |
|---|---|
| 5-th percentile | 4.5 |
| Q1 | 5.8 |
| median | 6.6 |
| Q3 | 7.4 |
| 95-th percentile | 8.2 |
| Maximum | 9.6 |
| Range | 8.1 |
| Interquartile range (IQR) | 1.6 |
Descriptive statistics
| Standard deviation | 1.1609316 |
|---|---|
| Coefficient of variation (CV) | 0.17769052 |
| Kurtosis | 0.78618261 |
| Mean | 6.5334469 |
| Median Absolute Deviation (MAD) | 0.8 |
| Skewness | -0.65989633 |
| Sum | 34516.2 |
| Variance | 1.3477622 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.6 | 201 | 3.5% |
| 6.8 | 199 | 3.4% |
| 6.5 | 193 | 3.3% |
| 6.2 | 192 | 3.3% |
| 7.4 | 190 | 3.3% |
| 7.2 | 187 | 3.2% |
| 7.3 | 183 | 3.2% |
| 7.1 | 182 | 3.1% |
| 6.7 | 181 | 3.1% |
| 7 | 176 | 3.0% |
| Other values (71) | 3399 | |
| (Missing) | 523 | 9.0% |
| Value | Count | Frequency (%) |
| 1.5 | 1 | < 0.1% |
| 1.6 | 1 | < 0.1% |
| 1.7 | 3 | |
| 1.8 | 1 | < 0.1% |
| 1.9 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
| 2.1 | 2 | < 0.1% |
| 2.2 | 2 | < 0.1% |
| 2.3 | 6 | |
| 2.4 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9.6 | 2 | < 0.1% |
| 9.5 | 1 | < 0.1% |
| 9.3 | 3 | 0.1% |
| 9.2 | 3 | 0.1% |
| 9.1 | 2 | < 0.1% |
| 9 | 10 | 0.2% |
| 8.9 | 6 | 0.1% |
| 8.8 | 17 | |
| 8.7 | 24 | |
| 8.6 | 33 |
imdb_votes
Real number (ℝ)
MISSING 
| Distinct | 3831 |
|---|---|
| Distinct (%) | 72.7% |
| Missing | 539 |
| Missing (%) | 9.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23407.195 |
| Minimum | 5 |
|---|---|
| Maximum | 2268288 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 45.5 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 52 |
| Q1 | 521 |
| median | 2279 |
| Q3 | 10144 |
| 95-th percentile | 115896.1 |
| Maximum | 2268288 |
| Range | 2268283 |
| Interquartile range (IQR) | 9623 |
Descriptive statistics
| Standard deviation | 87134.316 |
|---|---|
| Coefficient of variation (CV) | 3.7225441 |
| Kurtosis | 202.17161 |
| Mean | 23407.195 |
| Median Absolute Deviation (MAD) | 2107 |
| Skewness | 11.305849 |
| Sum | 1.232857 × 108 |
| Variance | 7.592389 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 43 | 11 | 0.2% |
| 25 | 11 | 0.2% |
| 172 | 9 | 0.2% |
| 14 | 9 | 0.2% |
| 30 | 9 | 0.2% |
| 74 | 9 | 0.2% |
| 38 | 9 | 0.2% |
| 48 | 8 | 0.1% |
| 6 | 8 | 0.1% |
| 35 | 8 | 0.1% |
| Other values (3821) | 5176 | |
| (Missing) | 539 | 9.3% |
| Value | Count | Frequency (%) |
| 5 | 4 | |
| 6 | 8 | |
| 7 | 6 | |
| 8 | 5 | |
| 9 | 6 | |
| 10 | 3 | 0.1% |
| 11 | 6 | |
| 12 | 5 | |
| 13 | 5 | |
| 14 | 9 |
| Value | Count | Frequency (%) |
| 2268288 | 1 | |
| 1994599 | 1 | |
| 1727694 | 1 | |
| 1472668 | 1 | |
| 1346020 | 1 | |
| 989090 | 1 | |
| 945125 | 1 | |
| 795222 | 1 | |
| 748654 | 1 | |
| 723306 | 1 |
| age_certification | df_index | imdb_score | imdb_votes | release_year | runtime | seasons | type | |
|---|---|---|---|---|---|---|---|---|
| age_certification | 1.000 | 0.217 | 0.242 | -0.303 | 0.233 | -0.728 | 0.079 | 0.999 |
| df_index | 0.217 | 1.000 | -0.179 | -0.278 | 0.962 | -0.120 | -0.541 | 0.178 |
| imdb_score | 0.242 | -0.179 | 1.000 | 0.219 | -0.137 | -0.178 | 0.150 | 0.331 |
| imdb_votes | -0.303 | -0.278 | 0.219 | 1.000 | -0.159 | 0.240 | 0.286 | 0.038 |
| release_year | 0.233 | 0.962 | -0.137 | -0.159 | 1.000 | -0.141 | -0.512 | 0.119 |
| runtime | -0.728 | -0.120 | -0.178 | 0.240 | -0.141 | 1.000 | -0.251 | 0.805 |
| seasons | 0.079 | -0.541 | 0.150 | 0.286 | -0.512 | -0.251 | 1.000 | 1.000 |
| type | 0.999 | 0.178 | 0.331 | 0.038 | 0.119 | 0.805 | 1.000 | 1.000 |
| df_index | id | title | type | release_year | age_certification | runtime | genres | production_countries | seasons | imdb_id | imdb_score | imdb_votes | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | ts300399 | Five Came Back: The Reference Films | SHOW | 1945 | TV-MA | 48 | ['documentation'] | ['US'] | 1.0 | NaN | NaN | NaN |
| 1 | 1 | tm84618 | Taxi Driver | MOVIE | 1976 | R | 113 | ['crime', 'drama'] | ['US'] | NaN | tt0075314 | 8.3 | 795222.0 |
| 2 | 2 | tm127384 | Monty Python and the Holy Grail | MOVIE | 1975 | PG | 91 | ['comedy', 'fantasy'] | ['GB'] | NaN | tt0071853 | 8.2 | 530877.0 |
| 3 | 3 | tm70993 | Life of Brian | MOVIE | 1979 | R | 94 | ['comedy'] | ['GB'] | NaN | tt0079470 | 8.0 | 392419.0 |
| 4 | 4 | tm190788 | The Exorcist | MOVIE | 1973 | R | 133 | ['horror'] | ['US'] | NaN | tt0070047 | 8.1 | 391942.0 |
| 5 | 5 | ts22164 | Monty Python's Flying Circus | SHOW | 1969 | TV-14 | 30 | ['comedy', 'european'] | ['GB'] | 4.0 | tt0063929 | 8.8 | 72895.0 |
| 6 | 6 | tm14873 | Dirty Harry | MOVIE | 1971 | R | 102 | ['thriller', 'crime', 'action'] | ['US'] | NaN | tt0066999 | 7.7 | 153463.0 |
| 7 | 7 | tm185072 | My Fair Lady | MOVIE | 1964 | G | 170 | ['drama', 'music', 'romance', 'family'] | ['US'] | NaN | tt0058385 | 7.8 | 94121.0 |
| 8 | 8 | tm98978 | The Blue Lagoon | MOVIE | 1980 | R | 104 | ['romance', 'drama'] | ['US'] | NaN | tt0080453 | 5.8 | 69053.0 |
| 9 | 9 | tm119281 | Bonnie and Clyde | MOVIE | 1967 | R | 110 | ['drama', 'crime', 'action'] | ['US'] | NaN | tt0061418 | 7.7 | 111189.0 |
| df_index | id | title | type | release_year | age_certification | runtime | genres | production_countries | seasons | imdb_id | imdb_score | imdb_votes | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5796 | 5796 | ts286386 | The Big Day | SHOW | 2021 | TV-MA | 45 | ['reality', 'romance'] | ['US'] | 2.0 | tt13887518 | 4.6 | 327.0 |
| 5797 | 5797 | tm985215 | Princess 'Daya'Reese | MOVIE | 2021 | NaN | 115 | ['romance', 'comedy'] | ['PH'] | NaN | tt13399802 | 7.2 | 45.0 |
| 5798 | 5798 | tm1004011 | Time to Dance | MOVIE | 2021 | NaN | 107 | ['drama', 'romance'] | ['IN'] | NaN | tt8622232 | 2.2 | 950.0 |
| 5799 | 5799 | ts307884 | HQ Barbers | SHOW | 2021 | TV-14 | 24 | ['comedy'] | ['NG'] | 1.0 | NaN | NaN | NaN |
| 5800 | 5800 | tm1040816 | Momshies! Your Soul is Mine | MOVIE | 2021 | NaN | 108 | ['comedy'] | ['PH'] | NaN | tt14412240 | 5.8 | 26.0 |
| 5801 | 5801 | tm1014599 | Fine Wine | MOVIE | 2021 | NaN | 100 | ['romance', 'drama'] | ['NG'] | NaN | tt13857480 | 6.9 | 39.0 |
| 5802 | 5802 | tm1108171 | Edis Starlight | MOVIE | 2021 | NaN | 74 | ['music', 'documentation'] | [] | NaN | NaN | NaN | NaN |
| 5803 | 5803 | tm1045018 | Clash | MOVIE | 2021 | NaN | 88 | ['family', 'drama'] | ['NG', 'CA'] | NaN | tt14620732 | 6.5 | 32.0 |
| 5804 | 5804 | tm1098060 | Shadow Parties | MOVIE | 2021 | NaN | 116 | ['action', 'thriller'] | [] | NaN | tt10168094 | 6.2 | 9.0 |
| 5805 | 5805 | ts271048 | Mighty Little Bheem: Kite Festival | SHOW | 2021 | NaN | 0 | ['family', 'comedy', 'animation'] | [] | 1.0 | tt13711094 | 8.8 | 16.0 |